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ABSTRACT 

The purposes of the present investigation are to 
evaluate a new, adaptive procedure for assessing multiple dimensions 
of self-concept for children younger than 8 years and to examine 
related theoretical issues. The multidimensional, hierarchical 
structure of self-concept is well established for older children, but 
there is a paucity of research and appropriate instruments for very 
young children. A limited amount of research suggests that 
self-concept is poorly differentiated and that a general self-concept 
raay not even exist. A sample of 501 students in kindergarten and 
first and second grades in Sydney (Australia) completed a variation 
of the Self -Description Questionnaire I (SDQI) using a new individual 
interview technique. At each grade level, confirmatory factor 
analyses identified all eight SDQI scales, including the general 
self-concept scale. With increasing age, the fit of the eight-factor 
model improved, and the size of correlations among the eight SDQI 
scales decreased, implying that self-concept becomes more 
differentiated with age. The results demonstrate that appropriately 
measured self-concepts are much better differentiated for very young 
children than previously assumed. A 55-item list of references and 
seven data tables are included. (Author/T«JH) 
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Sel-f-conc^ts Of Young CJiildren Aged 5 to 8: 
Their Measurement and Multidimensional Structure 

ABSTRACT 

The purposes of the present in»/estigatian are to evaluate a new, adaptive 
procecLire for assessing multiple dimensions of self^ncept -for children 
younger than 8 and to examine related theoretical issues. The 
multidimensional, hierarchical structure of self^ncept is now well , 
established for older children but there is a paucity of research and 
appropriate instruments -For very young children. A limited amount of 
research suggests that sel-f-concept is poorly differentiated and that a 
general self-concept may not even exist. 501 students in kindergarten, 1st, 
and 2r.d grades conpleted a variation of the SDQI using a new individual 
interview technique. At each grade level confirmatory factor analyses 
identified all 8 SHQI scales — including the General self-concept scale. 
With increasing age the fit of the 8-factor model improved and the size of 
correlations among the 8 SDQI scales decreased, implying that self-concept 
becomes more differentiated with age. The results demonstrate that 
appropriately measured self^-concepts are much better differentiated for very 
ycxing children than previously assumed. 
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A positive self-concept is valued as a desirable outcome and as a 
potential mediating influence leading to other desired outcones such as 
acadanic achievement. Despite the thousands of sel-F-concept studies 
condicted with elder students, there has been littlR research conducted with 
children below the age of 10, This is unfortunate as this developmental 
period may be critical in the formation of a positive self-concept — 
particularly in scLcational settings. This lack of research stems apparently 
from the dearth of instruments appropriate for measuring self-conc^ts for * 
children of this young age. 

Prior to the 1980s reviews of self-concept research based on .-£_Honses 
by older children and young adults iioted a lack of theoretical models and 
appropriate measuremsnt instruments. The Shavelson, et al. (1976) model of a 
self-concept was valuable in remedying some of these problems. They proposed 
self-concept to be a multifaceted, hierarchical construct that became 
increasingly distinct with age. In their model a general facet at the apex 
of the self-concept hierarchy is divided into academic and nonacademic 
components of self-concept. The nonacademic conponent is divided into 
physical, social, and emotional components, whereas the academic component 
is divided into aca iamic facets in partioalar subject areas &ich as English 
and mathem;itics. Particularly during the last decade, as researchers have 
developed apparently better self-concept instruments based on stronger 
theoretical models, support for the multidimensional ity of self-concept for 
older children and young adults has become well established (e.g., Byrne, 
1984; Dusei^ 8: Flaherty, 1981; Fleming & Courtney, 1984; Harter, 1982; Marsh, 
Byrne & Shavelson, 1988; Marsh, in press-a). 

Harter <1983, 1985, 1986) evaluated alternative models of self. Like 
Shavelson, et al. <1976), she rejscted models depicting self^concept as a 
simple suffriiated score based on items reflecting different domains (e.g., 
Cloopersmith, 1967) and argued for a multidimensional perspective that 
recognizes specific domains such as the physicfl, social, and academic 
facets of self. She also argued for the usefulness of a relatively 
unidimensional, global self^ncept like that described fay Rosenberg (1979). 
In discussing the Rosenberg approach to global self-vcrth she noted that it 
"finesses the complexities of the underlying hierarchy of discrete Judgments 
that may be responsible for such an overall Judgment about the self" (Harter, 
1986, p. 141). Combining these two perspectives, Harter recc.'wnsnded a hybrid 
approach that includes both content specific scales and a global scale. 
Empirical support for a multidimensional, hierarchical self-concept 
O proposed by Shavaison et al. (1976) and for the hybrid ,-ipproach proposed by 
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Harter is particularly strong in research using the Self Description 
Ckiestionnaire (SDQ) instruments (see Marsh, 1989, in press-a, for an 
overview). Three SDQ instrumc^nts designed for children of differing ages are 
based on the Shavelson et. al. rrodel. In research using the SDQI with young 
children, limitations in children's ability to respond to questionnaires has 
been overcome in part by reading aloud the SDQI items. Using this approach, 
Marsh, Barnes, Cairns and Tidman a9&i) tested large samples of students in 
gractes 2-5 Even for the second grade children, the SDQ factor structure wa^ 
reasonably well defined and the internal consistency estimates for the SDQI 
scales were moderate. In subsequent confirmatory factor analyses (CPAs) 
using the LISREL approach to testing factorial invariance. Marsh and Hocevar 
(1985) found that the SDQI factor structure (factor loadings) were 
reasonably invariant across the different ages. Consistent with the 
Shavelson et al. (1976) hypothesis that sel-f-conc^t bscomes more 
differentiated with age, they found that correlations among the factors 
becama smaller with age. Whereas a similar pattern of hierarchical factors 
was identified at each age, the self-concept factors became more distinct 
and tlie hierarchy became weaker with age. 

The purposes of the present investigation are to evaluate a new, 
adaptive procedire for assessing multiple dimensions of self-concept for 
children younger than 8 using a variation of the SDQI and to examine 
theoretical issues related to the factorial structure (or dimensionality) of 
sel-F-concept for these young children. The theoretical basis of this 
research derives largely from Harter' s research (1983, 1985, 1986; Harter & 
Pike, 1984; Si Ion & Harter, 1985) that focuses specifically on the self- 
conoepts of very yoing children, the Shavelson, si al. (1976; Marsh & 
Shavelson, 1985) model of self-concept, and previous SDQ research (e.g.. 
Marsh, 1988; Marsh, in press-a; Marsh, Byrne & Shavelson, 1988). The 
development of appropriate measurement devices for children of this age has 
important theoretical and practical in^Dlications. Theoretically, the study 
provides important evidence about tha ability of very young children to 
differentiate specific facets of self-concept and to form a generalized 
conception of ©elf, and about age and gender differences in salf-oonc^ts 
•for very young children. From a practical perspective, the ability to 
measure the self-cqncepts of 'very young children provides an inportant 
outcome measure for for teachers to better understand their students and for 
a wide variety of interventions designed for young children. 

The Self-concepts of Very Young 0)ildren 
Despite a growing consensus in findj .gs for older children, the status 
of the multidimBfnsionality and the factorial structure (or dimensionality) 
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of self-concept far very ysung childr^T-. is unclear. The extent to which 
self-concept is differentiated for young children apparently. reflects the 
cognitive development of the child (Stipek t Mclver, 1989; Si Ion & Harter, 
1985) and the appropriateness of the instrument used to assess self-concept 
(Harter & Pike, 1984; Stipek & Maclvsr, 1989). Whereas nxast researchers 
assume that the dimensions of self-concept become increasingly distinct with 
age, there is limited evidence on the the distinctiveness of self-concept 
factors at di-ffsrent ages (Marsh, 1989). Harter and Pike (1984; Harter, * 
1983, 1986), for example, suggested that for children younger than 8 a 
global sense of sslf^rth does not exist and specific facets of self- 
concept are not well differentiated. Stipek and Maclver (1989) noted that 
very young children have a poorly di-fferentiated concept of acaderiiic 
coo^ffitence but that it becomes better differentiated from other facets 
(e.g., social competence) during elementary school years. They also 
suggested, however, that the lack of differentiation may reflec± problems 
with existing measurement instruments and recommended the use of more 
appropriate assessment procedures. Wylie (in press) also noted the 
difficulties in assessing self-concept in young children, stating that "it 
is generally conceded that preschool children have some descriptive and 
evaluative self-conceptions, but the problem of measuring them is obviously 
a thorny one" (p. 70). Perhaps, as appears to have been the case for 
research with older children, progress in theory and research for very yooriy 
children will be stimulated by the development of better multidime.isional 
measurement instruments. 

In this section we discuss three issues related to evaluating the self- 
conc^ts of very young children that are the basis of the present 
investigation. First, there is a need to evaluate how to measure self- 
conc^t most effectively for very young children. As proposed by Harter 
(1983, 1986; Harter & Pike, 1984) this may require simpli-fied item contents 
or pictorial representations, simplified response formats, and individjially 
based interviews instead of conventional paper-and-penci 1 tests that are 
group administered. Second, there is a need to evaluate the factorial 
structure of self-concept responses for young children and to determine 
whether factors like those identified in the responses of older children can 
be found (e.g., Harter, 1982; Marsh, 1988, in press-a) . Third, there are 
important, unresolved issues surrounding the status of general self-concept 
in very young children. For example, Hartar and Pike (1984) claim that 
general self-concept evolves from the integration of domain-specific facets 
of BBl-ff and thus does not exist prior to the age of 8. 
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Ihe Measurement of Sel-fHajncept For Young Children 

Wylie (in press) has recently updated her classic review of self- 
concept measures (Wylie, 1974; 1979), and was particularly concerned to 
identify suitable instruments designed for very young children. She selected 
l^<^at were apparently the two best of existing instre'/nents. She noted, 
however, that there was insufficient evidence to adequately evaluate either 
instrument, but included them because no more fully developed self-concept 
measures were available for very young children. One of these instruments ^ 
relied on items from a variety of different domains to Infer a global, 
undifferentiated self-concept. Because an important -focus of the present 
investigation is to determine the factoria^l structure of self-concept and 
whether young children are able to differenticite among specific facets of 
self, this instrument is of limited relevance. 

The second instrument considered by Wylie, the Harter and Pike (1984) 
instrument, was designed to measure four self-concept scales: physical, 
cognitive, pe^rs, and maternal. Each scale is defined by six bipolar items 
that are represented by parallel verbal statements and pictures. For 
exarrple, respondents are shown two pictures, one in which the target child 
appears with one ^ther child and one in which the target child appears with 
fiv© other children. The respondent is told that the first target child 
doesn't have many friends to play with and that the other target child has 
lots of friends to play with. The respondent's first task is to indicate 
which target child is most like the respondent. After choosing a targat 
child, the respondents indicate whether they are a lot like the chosen 
target or just a little bit like the target child. For each item there are 
more specific prompts such as do you have "a whole lot of friends to play 
with" or "pretty many" (see Harter & Pike, 1981; 1984). This two-stage 
response format consisting of two dichotomous responses results in a 4-point 
response scale., 

Harter and Pike (1984) emphasized a numLjr of important features of 
this scale including: (a) items appropric*te to the developmental level of 
the children, (b) the pictorial format, and (c) the 4-point response scale 
that provider for a greater range of responses than typical dichotomous 
responses. Although not emphasized b/ Harter and Pike, anotnsr potentially 
important feature is that trtsy administered their instrument individually to 
each child instead of using a group administration procadure. as is typical 
with older children, l^^reas they did not provide details of their 
individial administration proceriures, this procedure offers the possibility 
for the administrator to ensure that the child understand© an item and, if 
necessary, to clarify the meaning of an itism for the child and the child's 
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ERJC typically found -for olds.- children, th-js supporting the -Frequently noted 



response. Of particular relevance to the present investigation, no global 
self scale was included because Harter's theoreticax and empirical research 
indicated that global self-concept does not evolve until approximately the 
age of 8 (Harter & Pike, 1984). 

The Factorial Structure of Self-conceot in Responses B>^ Youno Children 

Predictions about how self-conc^t and its factorial structure evolves 
v^ith age have been proposed from a variety of theoretical perspectives. A 
basic assumption of the Shavelson model is that self-concept beuames more * 
di-fferentiated with agf^,. Markus and Wurf (1987) note that the structure of 
self depends on both the infbrmatior available to an individual and the 
cognitive ability to process this irrfbrmation. Harter (1983, 1985) proposed 
a model in which self-concept becooies increasingly abstract with age. Her 
review of previous research suggested that self^conceptions shift from 
concrete descriptions of behavior in early childioad, to trait-like 
psychological constructs (e.g., popular, smart, good looking) in middle 
chiJdxDod, to more abstract constructs during adolescence. 

Harter (1983, 1985), consistent with her proposal that self-concept 
becomes increasingly integrated with age, reported that the concept of 
general sel-f^^arth doet: not evolve before the age of about 8. Harter and 
Pike (1984) reported that below age 8 children do not understand general 
sel-F-worth items or provide unreliable responses. Subsequent research (Si Ion 
8c Harter, 1985) suggested that mental age may be more important than 
chronological age. Harter's assumption is apparently in direct contradiction 
Coopersmith's (1967) contention that distinctions among specific domains are 
made by young children "within the context of the over-all, general 
appraisal of worthiness that children have already' made" (p. 6). 
Interestingly, interpretations by each researchier are based on the failure 
of factor analyses to identi-fy the intended sel-f-concept scales in their 
respective instruments, alb lugh the failure to support an a priori factor 
structure is a weak basis of support. Whereas Harter (1983; also see Marsh & 
Richards, 1908, Marsh & Smith (1982), Shavelson, et al., 1976) called into 
question the logical basis of Coopersmith's conclusion^ there does not 
appear to be ccmpelling support for either of these opposing perspectives. 

Speci-Fic factors. The Harter and Pike (1984) instrument was designed to 
measure four self-concept scales. Factor analyses, however, provided support 
for only two factors: competence (incorporating the physical and cognitive 
scales) and social acc©r**^nce (incorporating the peer and maternal scales). 
The authors noted that '-n factor structure is less di-Fferentiated than 
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assumption that the structure becomes oore di-Fferentiated with agso Based 
primarily on support -for this two-factor model, Harter and Pike suggested 
that young children do not di-Fferentiate among competencies in different 
areas although they do di-Fferentiate between general competence and general 
social acceptance, a^pport -for a similar two-Factor model was also -found in 
the Si Ion and Harter (1985) study of responses by children with 
chronological ages of 9-12 who had mental ages of less than 8. 

Harter and Pike's (1984) failure to support their a priori fcur--factar , 
structure provides a weak basis of inference about the structure of self^ 
concept, particularly when analyses were based on exploratory factor 
analyses rather than the methodologically stronger CFA that allows the 
researdier to specify the model to be tested (e.g.. Marsh & Hoceva'^, 1985). 
The failure to separate even the physical and academic components that are 
so robust in responses by slightly older students is surprising and requires 
-ftArther scrutiny. Correlations among the physical and academic scales in 
the Harter and Pike study varied from .43 to .56 and did not approach 1.0 
even after correction for unreliability. Furthermore, Harter and Pike (1984) 
noted that when the four scales were correlated with external criteria 
(e.g., teacher ratings, choice behavior, being held back a grade) there was 
support for the separation of the physical and academic scales. Also, other 
facets — particularly physical appearance — were not included that could, 
perhaps, be differentiated from other areas of self, are appropriate far 
very young children, and do not fit easily into the categories of either 
coapetence or social acceptance. Finally, even support far their conclusion 
that self-concept becomes more di Fferentiated with age was weak. They noted 
that their factor structure was less di-Fferentiated than th^t found with 
older children, but they did not administer their instrument to older 
children. More importantly, they did not offer any evidence that self- 
concept became more di-Ffsrentiated with age for the 4-7 age range that they 
considsred, suggesting instead that the factor structure was similar for 
their preschool /kindergarten sample and for their lst/2nd grade sample. In 
summary, whereas the Harter and Pike (1984) study is important, it may be 
prertature to conclude that whildren can only differentiate two broad 
ca*nponents of self. 

Global self-conceot. Harter and Pike (1984) did not include a global 
self scale on their instrument because *'both theory (see Harter, 1983) and 
empirical findings have led to the conclusion that children ere not capable 
of making Judgments about their worth as a person until approximately the 
^ age of 8" (Harter & Pike, 1984, p. 1970). In discussing the Harter and Pike 

ERJC resulto, Silon and Harter (1985) noted that •'below the age of 8, children 
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either do no^- understand the [general] sel-f-^rth 'I terns, produce extremely 
unreliable estimates, or both" (p. 227). However, neither the theoretical 
nor the empirical support -fior this conclusion is presented in sufficient 
detail in either of these studies to be evaluated adequately. 

In Harter and Pike (1984), as elsewhere (e.g., Harter, 1986; Silon & 
Harter, 1985), the reader is referred to Harter (1983) for the theoretical 
basis of this conclusion. Whereas Harter (1983) provided a rich theoretical 
framework for the development of self, she did not directly address the 
theoretical rationale for the assertion that general self-concept does not 
exist before the age of 8 and her general theoretical discussion did not 
provide a firm basis for it. For example, Harter (1983) noted that the 
capacity for limited hierarchical organization first appears during the 
concrete operational period (e.g., I'm smart because I'm good at reading, 
spelling, and mathematics), but she also noted that when general trait 
labels first become available, children tend to use an "all-^r-none" 
thinking. Whereas children are unable to integrate specific components of 
self to form a global self^concept at Stage I in Harter' s (1983) 
developmental schema, children at this stage think they are all good (or all 
bad) across a wide variety -yf domains. In the Harter schema, however, it is 
not until stage IV (middle adolescence) that children are capable of higher- 
order abstractions. , From this perspective is not clear why children aged 8- 
12 are able to have general self-concepts, though her empirical research 
indicates that they are. Eli^ewhere Harter (1983) noted that whereas very 
young children can experi nee emotions such as pride and shame, definitions 
of these emotions focused on others (Dad was proud of me because . . . ) at 
ages 5-7 and that self^references did not emerge until the age of 8. Whereas 
it is reasonable that very young children do not have the cognitive capacity 
to integrate specific components of self to form a general sense of self, 
this presupposes that general self follows rather than precedes self- 
concepts in specific dc3mains. If self-concepts in specific domains are 
derived from a general sense of self, then children may ^ot require this 
integrative capacity in order to experience an overall sense of self. On the 
basis of Harter's (1983) review it appears likely that the processes 
underlying the formation of general self vary systematically with age but it 
appears premature to conclude that very young children do not have a general 
self-oonoept. 

Harter and Pike (1984) provided even less discussion of the empirics , 
basis for their assumption. Silon and Harter (1985), referring to the Harter 
and Pike research, noted that the general self scale lacks reliability, but 
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no reliability estimates were presented -for the global scale In either 
study. SI Ion and Harter (1985) administered Harter's (1982) instrunenc 
designed -far older children to children ^jho had chf ofXDlogical ages of 9-12 
but mental ages less than 8. They reported thc.t items from the global scale 
did not emerge as a separate factor and did not load consistently on either 
of the two factors (general competence and acceptance) that did emerge. 
Responses by these older but intellectually disadvantaged children, however, 
may provide a dablous batais of generalization to responses by very young 
children. Furthermore, here again it must be noted that the failure to 
replicate a factor structure is a '>«ak basis of support for this contention. 
A much stronger test would have been possible had the researchers used CFA 
as in Marsh and Hocevar (1985) Instead of exploratory factor analysis. 

The Present Investiqatiori 
In the present investigation 501 young children in kindergarten, 1st, 
and 2nd grades responded to an individually presented version of the SDQI 
using a modified response format. Subsequently, about 2 weeks after the 
ind'ividjially administered SDQIs were collected, the typical group 
administered SCXJI was collected from 1st and 2nd grade children. The factor 
structure and responses for the group and indivldu'Jly adninistered versions 
were compared. The major aims of the study are: (a) to establish the 
psychometric properties of these responses and to determine whether the SDQI 
factor structure found with older children can be replicated; (b) to test 
Harter' s claim that general self-concept does not exist before the age of 8; 
(c) to test the Shavelson et al. (1976) hypothesis tnat oelf-concept becomes 
more differentiated with age and provide more specific data on how the 
factor structure of self-concept varies In the age range of 5 to 8 j (d) to 
compare indivldjial and group administered versions of the instrvirnent, and 
(e) to evaluate sex and age differences for these very young children. 

Methods 

S ubjects 

A total of 501 students from kindergarten (n=163), grade one (n=169), 
and grade 2 (n=169) participated In the study. Children in each of the three 
grade levels were predominantly 5 year., of age (kindergarten), 6 years of 
age (1st grade), and 7 years of age (2nd grade). The subjects came primarily 
from middle class families ahd attended one of three infant schools in 
suburban metropolitan Sydney, Australia. 
Instruments; The SDQI 

The SDQI (Marsh, 1988, in press-a) is is one of a sat of three 
instruments daslgnsd to measure multiple dimensions of self-ooncept -for 
ERXC" P»"Qadolesc8nts (8DQI)^ for early and m^^dle adolescents (8DQII), and for 
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late adolescents and yoing- adults (SDQIII). The instruments are ba&ed on the 
Shavelson et (1976) model of self-concept and the facets of self 
proposed in that model. More than 30 ptblished factor analyses have 
identified the factors that each instrument is design'r.d to measure. Other 
research has shown that: (a) the reliability of each scale is generally in 
the O.eOs and 0.90s whereas correlations among the factors are quite small 
(median rs less than 0.20), <b) the sel-f^ncept responses are substantially 
correlated with self-concepts in matching areas inferred by significant * 
others, <c) academic achievement indicators are substantially correlated 
with academic areas of self-concept but nearly uncorrelated or even 
negatively correlated with nonacademic areas of sel-f-concept and general 
sQlf-concept, <d) self-concept factors are systenaticaliy and logically 
related to a variety of other constructs iticluding age, gender, locus of 
control, self-attributions -For the causes of academic successes and 
failures, physical fitness and participation in sports, and interventions 
designed to enhance sel-f-concept. This research provides strong support for 
the construct validity of responses to the SDQ instruments for childrtan aged 
10 or older, and perhaps children as voung as 8. In her review of academic 
self-concept measurec, Byrne (in p sss) concluded: "it is apparent that the 
SDQ I, li and III, without qv'est.on, are the most rigorously validated 
instruments currently available to researchers; each is solidly linked to 
the hierarchical model of SC CseH^conceptD theory, and their separate ACS 
Cacademic self-concept] subscales have been shown to be psychometrically 
sound" <p. 29). 

The SDQI (Marsh, 1988) assesses three areas of academic self-concept 
(reading, mathematics, and school self-co.icept) , -fbur areas of non-academic 
self-concept (physical ability, physical appearance, peer, and parent- 
relations) and a general-self scale. Three total scores can also be -ntasured 
on the basis of these scales; academic sel-F-concept (the average of reading, 
mathematics, and school self-concept scales) , non-academic self-cona it (the 
average of physical, appearance, peer, and parent '/elations self-concept 
scales), and total self (the average of academic and non-acadamic total 
scales). For both the standard (group adninistered) and individually 
administered versions of the^SDQI each of the 8 SDQI scales i.as defined by 
responses to the same 8 positively worded items. On the standard SIDQI there 
are an additional 12 negatively worded items. Because previous research has 
shown that children have trouble responding appropriately to the negatively 
worded items (Marsh, 19S6a), they are not included in the scares derivf»d 
Pl^9^- ^ SDQI (Marsh, 1988). For purposes of the individually achiinistered 
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SDQI usad here, the negatively worded items were excluded altogether. As 
described below, the response scale typically used with the SDQI was also 
altered -for purposes of just the individually adninistered responses. 
Procedures 

Procedires -for the administration of the standard SDQ (see Marsh, 1988) 
were adjusted to enable the modified SDQ to be ad^ninicjtered by trained 
interviewers to subjects in an individual interview. The interviewers were 
120 college students in a primary teacher education program who already had* 
experience working with young children. Interviewers were from a large 
teacher education course in social studies that contained a component on 
self-concept research. Participation consisted of attending a preliminary 
training session and subsequently testing children from each of the three 
age groups. Training consisted of attending a two hour training session in 
which procedires for administering the instrument tMsre explained, a ten 
minute training video of a kindergarten child responding to all procedures 
was viewed, and a ten minute adnr.ini strati on practice session took place with 
another trainee interviewer responding to the questionnaire. Written su-wnary 
instructions of procedures discussed in the training session were 
distributed to interviewers. 

At each school approximately one-third of the interviewers 
simultaneously condicted interviews with all students from a particular 
class. Using this procedire, all pupils in kindergarten, 1st and 2nd grades 
were tested in each school. The testing was conducted individually and 
pupil J were interviewed in a location on the school grounds that was chosen 
to ensure responses from other children would not be overheard. 

Each testing session began with a brief set of instructions during which 
the subjects were assured of the confidentiality of their respor«es. As a 
component of these instructions, four example items were presented and 
discussed. Children wert ancouraged to indicate any difficulties tliey 
experienced in responding to an item. After reading each example item twice 
in rapid succession the interviewer asked the child if he/she understood the 
sentence. If the child did not understand the sentence the interviewer 
explained the sentence fin-ther, ascertained if the child understood the 
sentence, re-read the sentence, and requested a response. After ascertaining 
that the child understood thb example item, the intervi^r initially asked 
the child to respond "yes" or "no" to the sentence to indicate whether the 
sentence was true or false as a description of the child. If the child 
initially responded "yss" the interviewer then asked the child if he/she 
FR?r "^"^."y^ always" or "yes sometimes". If the child initially responded "no" 
t he intervie wer then asked the child i-Pj^g^s^ meant "no alwr.ys" or "no 
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sometimes". The second response probe was stated -for every response even 
i^jhen it w-i; answered in the initial response (e.g. the dnild said "yes 
always** ±ead of "yes"), thus providing a check on the accuracy of the 
child's initial response. After the child successfully responded to 
example items and any questions were answered, the interviewers then read 
aloud each of the 64 positively worded SDQ I items. Halfway through th« 
administration of the SDQI items the interviewer asked the child to do some 
physical activities for a brief period before proceeding to administer the * 
remaining 32 items. This procedure was included to cater for young children 
with short attention spans. After presenting each of the -First four items 
the interviewer asked tS^B child i-f he/she understood the sentence before 
obtaining a respoise. The child was subsequently .ncouraged to indicate any 
difficulties he/she experienced in responding ro the remainder of the items. 
This procedire was included to encourage children to seek clari-fication of 
any item they did not understand. If the child stated that the item was not 
understood the interviewer explained the meaning of the item further and 
ascertained the child understood the sentence before readninistering the 
item. Children were periodically asked if they understood subsequent items 
diring the remainder of the administration. Pilot work indicated that some 
kindergarten students had difficulty understanding a few of the items, and 
these items i^ere initially presented in their original form and then 
paraphrased to ensure that they were understood. Thus, for example, children 
were told that mathematics meant work with numbers. 

If a child did not initially respond to an item Dy stating yes or no, 
the interviewer explained the meaning of the sentence^ re-read the sentence 
and requested a response. If the child still did not respond appropriately 
the item was circled and re-read after the administration of remaining 
items. I-f the child still did not respond appropriately the child was asked 
if he/she understood the sentence. If the child did not understand the item, 
the item was further clarified by the interviewer. If the child indicated 
he/she understood the sentence but could not decide whether to respond yes 
or no, the interviewer recorded a response of 2.5, halfway tietween the 
responses of "no sometimes" and "yes sometimes. Because the occurred 
infrequently and children i^re not told of this option, this middle category 
was used very infrequently. 

Approxifrtatieiy 2 weeks a-fter the individually adninistered SDQIs were 
collected, the SMI was administered to nearly all the 2nd grade (n=158) 
children and a majority of the 1st grade (nplll) students using the typical 
group* administration procedures (Marsh, 1988) instead of individually 
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administered interviews. The group administration procedure was deemed to be 
inappropriate -Fd^ kindergarten children — even after crxnpletion of the 
individjalized administration procedure. Two classes of let grade students 
from one school were unable to participate in this second phase of the study 
for reasons unrelated to the purposes of the study. For purposes of the 
group administration, each child was given a copy of the SDQI. The 
researcher read the instructions aloud, clarified them, presented several 
example items, and then answered any questions before presenting the SDQI ^ 
items. The SDQI items an the questionnaire given to each child were then 
read aloud twice and children were asked to respond to the items on their 
questionnaire. For purposes of the group administration, the standard five- 
point response scale (false, mostly false, sometimes true/sometimes false, 
mostly true, true) was used. The group administration procedures used here 
and those presented in the manual differed in that children were asked to 
place a ruler under the item being read aloud to facilitate marking their 
response on the correct line. 
Statistical Analyses 

The statistical analyses consisted of an evaluation of the psychometric 
properties (reliability and factor structure) of the self-concept responses 
and of sex and age effects in the selfHzoncept ratings. Separate factor 
analyses were conducted on responses by each age group separately and for 
the total group, using the LISREL approach to CFA as in Marsh and Hocevar 
(1985). Multivariate and univariate A^JC^/A were used to test sex and age 
effects in the multiple dimensions of self^-concept. 

As part of the analyses, correlations among SDQI scores based on 
responses collected in the present investigation were compared with those 
based on responses in the normative archive of SDQI responses u ' students 
in grades 2-6 (Marsh, 1988; also see Marsh, 1989) . Most SDQI research has 
used factor scores that are routinely produced by the SDQI scoring program, 
based on factor score coefficients derived from a factor analysis of all 
responses in the normative archive (Marsh, 1988, 1989). Whereas the scoring 
program ccmputes both factor scores and simple scale scores based on an 
unweighted aver^ige of responses to the items designed to measure each scale, 
correlations among the SDQI scores are typically smaller for factor scores 
than for scale scores (Marsh', 1989). For purposes of the present 
investigation, correlations based on both sets of scores are considered. 
Because many of the responses in the normative archive are based on an 
earlier version of the SDQI that did not contain the General Self scale on 
O the current version and considered here, the General Self scale was not 

il£ included for purposes of Just these comparisons with the normative archive. 

1 5 - 
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Confirmatory factor analysis. As is typical in SDQ research (e.g., 
Marsh, 1988; Marsh & Hocevar, 1985) -factor analyses were conducted on item- 
pair scores (or parcels) in which the first two items in each scale are 
averaged to form the first item pair, the next two itnms are used to form 
the second pair, and so forth. Thus the 64 SDQI iterrts i>«rc reduced to 32 
item pairs that were used in subsequent analyses. Analysis of item pairs 
instead of individial items is advantageous because the item pairs tend to 
be more reliable, to be more normally distributed, and to have less * 
idiosyncratic variance than do individual items. Also, it is often 
recommended that there are at least 5 times as many subjects as variables in 
•factor analyses and this guideline was satisfied for s^arate analyses at 
each grade level by factor analyzing item pairs ir^tsad of items. 

In CFA, particularly when results from di-fferent samples are to be 
corpared, it is apprcpriate to analyze covariance matrices instead of 
cc^'-alation matrices. Parameter estimates based on covariance matrices, 
however, are not so easily interpreted as the standardized measures based on 
correlation matrices. An appropriate compromise (Joreskog & Sorbom, 1988) is 
to standardize measured variables across the total sample and then to 
construct covariance matrices for each of the subsamples to be considered 
separately. This approach was used in the present investigation. 

In CFA the researcher posits alternative a priori models to fit the 
dHta, compares the ability of the models to actually fit the data, and, 
perhaps, posits further a posteriori models if the a priori models do not 
adequately fit the data. For present purposes, three a priori models were 
•fit to the data from each year group separately and to the total sarrple 
across all three year groups: (a) a one--factor model in which all measured 
variables loaded onto a single general factor factor; (b) a two-factor model 
in which the variables from the three academic scales defined an academic 
-factor and the rest of the measured variables defined a non-academic factor; 
and (c) an 8-f actor model in which each factor corresponded to one of the 
SDQI scales. In these models, each measured variable was allowed to load on 
only the one factor that it was posited to reflect <an independent clusters 
model). Factor correlations and uniquenesses (residuals or specific 
variances for each measured variable) were estimated, but correlations among 
the uniquenesses were restrained to be zero. Support for the a priori factor 
structure of the SDQI responses is based on the per-formance of the 8-factor 
model rhe corresponds to the design of the instrument. 

An important unresolved issue is how to datermine whether the fit is 
ERXC sufficient to justify support of an a priori model. In general the approach 

16 



SeH=-concepts of Young Children 14 
is to evaluate the parameter estimates to d&tarmine whether they are 
cxDnsistent with predictions and to evaluate goodness of fit for alternative 
models. Researchers have developed a variety of goodness of fit indicators 
to aid in this process and those that appear to be among the most useful 
(see Bentler, in press; Marsh, Balla & McDonald, 1988; Marsh, McDonald & 
Balla, in press; McDonald & Marsh, in press) are considered here: 

1. chi--square goodness of fit statistic (X^) = n x FF; where N = sample 
size and FF is the maximum likelihood fitting function 

2. TuckerH^is Index (TLX) = TLX = (Xn^/df^ - Xt^/dft?/ (xV^^n ' 
Cl.O]); t^re df=dBgrees of freedom and the subscripts n and t refer to 
values obtained from a null mode} and the target model respectively, 

3. Unbiased Relative Fit Index <URFI) = (Dkn - Dkt) / (Dkn ~ 0); Note 
that this is an unbiased counter-part of the fit index originally proposed 
by Bentler and Bonett (1980; see Bentler, in press; McDonald & Marsh, in 
press) where Dk = FF - df/N. 

Better fitting models ha^/e lower chi--squares and higher TLIs and URFIs. 
AltlTough there are no clearly established rules as to what constitutes a 
"gdod" fit, a widely applied guideline for relative indices like the URFI 
and the TLI is .90 (e.g., Bentler & Bonett, 1980; Bentler, in press). An 
index of .90 can be roughly interpreted as being able to explain 90"/. of the 
covariation among the measured variables. The TLI and URFI dif^r primarily 
in that the TLI has' a penalty for model complexity whereas the URFI does 
not. If none of the a priori models is able to fit the data adequately, the 
researcher may propose additional a posteriori models to better fit the 
data. LISREL provides modification indices (see Joreskog & Sorbom, 1983) 
that estimate the change in chi-square due to adding additional estimated 
parameters to the model. For example, the modification indices may suggest 
that a particular variable should be allowed to load on more than one factor 
even though the a priori modal posited independent clusters. 

Preliminary results — internal consistencv estimates . Internal 
consistency estimates for the 8 individually adninistered scales (Table 1) 
are in the .70s and .80s for each year group and for the total sanple except 
for the Parents (.692) and Physical (.505) scales with kindergarten 
respondents (see Table 1). In general, these reliability estimates increase 
with age (median es^timates are .735, .797, and .819 for kindergarten, 1st- 
grade and 2nd grade students). For the three total scores, interestingly, 
the reliability estimates for the three age groups are more similar than are 
the estimates for individual scales, and those for 1st grade students are 
O slightly higher than those of 2nd graders. The internal consistencies of the 

y^iL General self scale (.726, .781, ,742, respectively) are moderate — thouoh 

^7 



Sel-F-oc-tcspts of Young Children IS 

belc3w the median reliabilities for each group - and show less age effects 
than do the averages of all scales. The internal consistencies for the group 
administered respond (administered to 1st and 2nd graders) show a similar 
pattern of results, though the size of the estimates is slightly higher. 
Overall, the internal consistency estimates provide reasonable support for 
the SDQI responses and indicate that responses to the General self scale are 
reasonably reliable for all three year groups. Whereas there are systematic 
age di-Fferences in the reliabilities of specific scales, age differences are* 
smaller and less systematic for the three total scores and the General scale. 

Results 

Factor Structure For the Individually A dministered SDGI Responses 

Three a priori models were considered that posit <a) one general 
factor; <b) an academic and nonacademic factor, and <c> 8 factors 
corresponding to the 8 SDQI scales. These models were fit to the data from 
each year group separately and to the total sample across all three year 
groups. Support for the design of the SDQI requires requires that the 8- 
fac±or model is able to fit the data reasonably well, that it fits the data 
substantially better than the 1- and 2-factor models, and that parameter 
estimates based on the 8-Factor model are reasonable. 

^iegdress of fit^ Inspection o" the URFIs (Table 2) indicates the 8- 
factor model fits the data substantially better than the 1-factor or 2- 
factor i^dels. These results are consistent -for the total group and -fbr each 
year group considered separately. For the 1-factor and 2-factor moctels, the 
fit for the 2nd grade data is poorer than for either the kindergarten or 1st 
grade data. For the 8-factor model, however, the fit for the 2nd grade data 
is better than those for the younger children. These results indicate that 
the a priori 8-fa^r model consistent with the design of the SDQI does 
substantially better than models positing -fewer factors, and that the 
advantage of the 8-factor model is larger for 2nd grade children. Using the 
£.90 guideline as a criterion of a "good" fit, the fit of the 8-factor a 
priori model is good for the total sample (.916), almost good -for the 2nd 
grade data (.887) and the 1st grade data (.869) grade data, and somewhat 
less than good fer the kindergarten data (.824). 

Insetrt Table 2 ftboat Here 
Inspection of LlSREL's modification indices (Joreskog & Sorbom, 1983) 
for the 8-factor model indicated that allowing measured variables to load on 
•Factors other than ths one they were designed to measure would not improve 
fit substantially, but that freeing some correlations among uniquenesses 
ERTC associated with each measured variable would improve the fit. For the total 
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sample and each of the separate samples, the oxDdif ication indices indicated 
that freeing correlated uniquenesses associated with the first two 
indicators of the Appearance factor and the first two indicators of the 
Reading factors would have a substantial effect* This suggests that the 
measured variables within each pair are more strongly correlated than can be 
explained by their relation to the coovnon factor that they are designed to 
measure. Because of this consistency across the different samples, an 8- 
factor a posteriori model was tested in which these additional parameters 
were freed. Whereas the inclusion of these two additional parameters 
significantly improved the fit far the total sanple and each subsample, the 
change in the URFIs was modest (Table 2). Because the conclusions based on 
the 8-factor a priori model differ little from those based on tlie 
corresponding a posteriori model, will focus on the a priori model in 
subsequent discussion. 

Parameter estimates > The evaluation of the factor loadings (see Table 
3) for the 8-Factor a priori models indicates that all 8 factors — for each 
age 'group considered separately and for the total Jsample — are well- 
defined; every factor loading is statistically significant and nearly all 
are substantial in size. The mean factor loading,, however, is larger -for 
older children. 

Insert Table 3 About here 
Particularly for the youngest children we were concerned that the SDQI 
was so long that the c^lity of responses might deteriorate for items near 
the end of the questionnaire. Because the items within each scal(=» are 
randomly ordered on the SDQI, inspection of the factor loadings from the 
first half of the SDQI with those from the second half provides a test of 
this possibility. Inspection of the factor loadings (Table 3), however, 
suggests that just the opposite occurred. The sizes of the factor loadings 
for the first two indicators are systematically lower — not higher — than 
those for the last two indicators, and the size of this difference is larger 
for the youngest children. These results suggest that — particularly -for 
the kindergarten children — there was a practice effect such that the 
initial responses were systematically less effective than those of 
subsequent responses but that there was no apparent deterioration in 
responses near the pnd of the SDQI. 

The size of factor correlations provide one indication of how well 
children are able to differentiate among the 8 factors. In evaluating the 
size of the factor correlations in CFA, it is important to note that these 
are correlations among latent constructs that have been corrected -for 
unreliability and are thus larger than correlations between sinple scale 
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scares 1 (see subsequent discussion of Tables 5 and 6), Nevertheless, the^ 
mean correlations among factors for both kindergarten (.686) and 1st 
grade (.658) samples are substantial, ^^rf^erea3 the mean correlation for the 
2nd grade sample (.478) is substantially smaller. Despite the difference in 
mean correlations for di-Fferent age groups^ there is a consistent pattern in 
the relative sizes of the correlations. For all three samples the highest 
correlations involve the General and School scales. Specifically, the 
General Self scale is consistently correlated most highly with Physical * 
Appearance, Peers, and the School scales, whereas the School scale is also 
highly correlated with the f?eading and Maths scales. 

The size of uniquenesses represents the specific variance and error 
variance in each measured variable that cannot be explained in terms of the 
8 a priori factors. Consistent wiLt results already discussed, the sizes of 
the uniquenesses decrBase systematically with -^ge (Table 3). 2 

Ibe barter Model. Harter and Pikw (1984; Si Ion & Harter, 1985) used 
exploratory factor analyses to test the construct validity of responses to 
their self-concept instrument. Instead of the 4 scales that the instrument 
was designed to measure, only 2 factors were identified; a competence factor 
incorporating the physical and academic scales and an acceptance factor 
incorporating the social and maternal scales. In results summarized in this 
section we sought to evaluate this two-factor model using CFA instead of 
exploratory factor analysis. 

The 4 scales the Harter and Pike instrument was designed to measure 
correspond most closely to the Peer Relationships, Parent Relationships, 
Physical Ability, and School scales from the SDQI.^ We fit responses to just 
these SDQI scales with three different models; a one-factor model in which 
all variables loaded on one (general) fdctor, a two-factor nadel like that 
proposed by Harter, and a fbur-f.:ictor model in which each of the 4 scales 
defined a separate factor. For cjach age group and fay the total sample, the 
Harter and Pike (two-factor) model performed marginally better than the one- 
factor model (see Table 4) but substantially poorer than the four-factor 
model. Consistent with earlier findings, the adv^antage of the four-factor 
model over the one- and two-factor models was positively related to the age 
of the children. These findings lead us to reject the two-factor model based 
on Harte:- and Pike '(1984) and to conclude that — even for young children — 
self-t:oncept is more differentiated than suggested by the Harter and Pike 
results. 

Insert Table 4 Abcj^Jit Here 
^^omparioon of th§ Individually and Group Administered SPqis 
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Students in just the 1st and 2nd grade samples cx>mpleted the standard 
group administered SMI approximately 2 t^jeeks after the individually 
administered SDQIs. In the evaluation of th^ group actninistered responses 
several features are important. First, the group administration procedure is 
typically not recommended for children as young as these (Marsh, 1988), has 
only been used with 2nd grade students as part of one study (Marsh, Barnes, 
Cairns & Tidman, 1984), and has never been used with 1st grade children. 
Secondly, it is very likely that young children who — as in this study — ^ 
have recently completed the individually adninistered SDQI will be better 
able to cope with the group adf.unlstered SDQI than similar children who have 
not been previously exposed to the SDQI. Hence, an evaluation of the SDQI 
responses based on the group administration procedure in this study are 
unlikely to generalize to other samples. Given these considerations, several 
issues are relevant: 

1. To the extent that children of a particular age are unable to cope 
with the group administered SDQI in the present investigation, other children 
of a similar age are even less likely to be able to do so in other studies :ii 
which the individually administered SDQI has not already bef \ completed. 

2. To the extent that the 8-factor model considered earlier is able to 
fit individially administered data better than the group adninistered data - 
- despite the likely advantage of group actninistered SDQI doe to prior 
exposure to the individually administered SDQI — then there is strong 
support for the superiority of the individual ackninistration. 

3. To the extent that children respond appropriately to both the 
indivicLially administered and group administered SDQIs, then the coaparison 
of these responses provide useful information about the short-term stability 
of self-concept responses. 

The most ioportant comparison o-f the group-adninistered and 
individually-administered responses is the goodness of fits for the various 
models. For the group-administered responses, as with the individually 
administered responses, the 8--factor model is able to fit the data 
substantially better than the 1- and 2-f actor models both for the separate 
groups and the total sample (Table 2). There are, however, important 
differences in the comparison of fits for the lt.c and 2nd grade responses. 
Overall, fits are tpetter for 'the individually adhninistered data than for the 
group-administered data. Also, the differences between fits for 1st and 2nd 
grade data are larger for the group-administered data. For the individually- 
administered data, the 8-factor a priori radel was able to fit the 1st grade 
^ data nearly as well as the 2nd grade data. In contrast, for the group- 
ERJC administered data, the fit for the 1st grade sample is substantially poorer 
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than -for the 2nd grade data. 

Parameter estimates -For the individua^y- and group-adninistered 
responses are not directly cooparable becaui's of the di-fferent response 
scales. Several observations about the group-adninistered results (Table 3), 
however, are informative. The factor loadings, for all 8 scales are 
statistically significant and substantial for both age groups and for the 
total sample. Factor correlations are very large (,682) fb'/- the 1st grade 
responses and substantially smaller far the 2nd grade sample (.425;. The 
dif^ference in mean correlations between the two year groups is also larger 
tf»an obsen/ed far the iiidividxally ad'ninistered response. (.671 and .478). 
The uniqMenesses are also substantially larger for the 1st grade responses 
than fbr the second grade responses. 

In summary, the 8-factor a priori mocfel fits the individually 
administered responses better than the group adninistered responses. This 
di-Fference is modest for ths 2nd grade data, but more substantial fo. the 
1st. grade data. Because the design of the study was biased in favor of the 
group-administered responses — since they came after the individually 
administered responses — we interpret the results as demonstrating the 
superiority of the individxally administered responses for both 1st and 2nd 
graders, but particularly for the 1st graders. Although the group 
administration procedure '^s not used with kindergarten students because it 
was deemed to be inappropriate, the advantages oF the individual 
administrations can be assumed to be even larger for this age group. 
Correlations Between Individually- and Grouo-Adm i nist< ^^ red Scales 

Results presented above suggest that the individual acininistration 
proceciire is apparently effective with all three age groups whereas the 
group administration procedure is effr^ctive with 2nd graders and, perhaps to 
a lesser extent, 1st graders. Support for the group ackninistration 
procedure, however, m^y not generalize to other studies in which this 
procedure does not follow the individual adninistration procedure. 
Nevertheless, at least for data in the present investigation, it is usefUl 
to examine correlations between scales derived from the two procedures. I-f 
both administration procedures ware eqa'iUy effective at ir>rerring true 
sel-F-conoepts, then correlations between the two sets of scores i^ould 
represent short-term stability. Because the two adfiiinistration procedures 
are apparently not equally effective, tha correlations reflect a corbination 
of agreement between the two procedures and short term stability. 

Scale scores for the two administration procedures were computed by 
O „ taking the unweighted average of responses to the 8 items designed to 
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meaBure each scale. Correlations (Table 5) among the 16 scales — the 8 SDQI 
scales from each administration procedure — are presented separately -for 
Ist-graders (abo/s the main diagonal) and 2nd-gKaders (below the main 
diagonal). Each of these correlation matrices is a MTMM matrix in which the 
SDQI scales are the multiple traits, the two administration procedures are 
the multiple methods, and correlations between matching scales -From the two 
administration procedures (those in < » are convergent validities.^ In 
evaluating MTMM matrices (e.g., Campbell & Fiske, 1957; Marsh, 1989), it is^ 
typical to consider convergent validity and discriminant validity 
separately. Convergent validity refers to the agreement between multiple 
/nethods of assessing the same trait and discriminant validity refers to the 
extent to which the traits are distinguishable. 

Insert Table 5 About Here 

Convergent validity. All the convergent validities for both year groups 
are statistically significant and most are substantial in size. The mean 
convergent va'tidity is, howe>/er, substantially larger for the 2nd grade 
responses (.50) than for the 1st grade responses (.38). Correcting the 
convergent validities for unreliability (see Table 1) substantially 
increased the size of the coefficients, but did not reduce the difference in 
convergent validities for the 1st and 2nd grade responses (means of .47 and 
£.62, respectively). 

Discriminant validity. Discriminant validity is typically assessed by 
comparing convergent validities (hcmotrait-heteromethod correlations) with 
correlations between different traits assessed by different methods 
(heterotrait--heteromethod coefficients) and with correlations between 
different traits assessed by the same methods (heterotrait-monometh' ^ 
coefficients). Applying these two criteria: 

1. For the 2nd grade data, convergent validities (mean = .50) are 
higher than heterotrait-heteromethod coefficients (me:..! = .23) for 9954 of 
the 112 comparisons. For the 1st grade data, convergent validities (mean = 
0.38) are^higher than hetGrotrait--heteromethod coefficients (mean = .21) for 
9iy. of the> 112 comparisons. 

2. For the 2nd grade data, convergent validities {mean = .50) are 
higher than heterotrait-homomethod coefficients (mean = .42) for .6954 of the 
112 comparisons. For the 1st grade data, convergent validities (mean = 0.38) 
are higher than heterotrait-homomethod coefficients (mean = .52) for only 
2854 of the 112 comparisons. For both age groups, heterotrait-homomethod 
coefficients for the group administered scales are higher and resulted in 
more yiolatior^ of this criterion than did the those based on the 
individually administered scales. 
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The application cDf the traditional Campbell-Fiske criteria provide 
clear support -for convergent validity and good support -for at least one 
aspect of discriminant validity. Support for both convergent and 
discriminant validity was substantially stronger fop the 2nd grade data than 
■For the 1st grade data. There is, however, also evidence of a su.bstantial 
method effect associated with each of the adninistration methods. This 
apparent method effect is larger for the 1st grade responses than the 2nd 
grade responses, and is larger for the group-adninistered responses than for 
the individually-administered responses. 

Correlations ftugng SDQI Scores; A Comparison UJith the Normative Archive 
The SDQI itemn used in both the individually and group administered 
procedires are the same as those used for students in grades 2 to 6 that 
comprise the normative archive for the SDQI test manual (Marsh, 1988; also 
see Marsh, 1989). Thus it is meaningful to compare correlations among SDQI 
scales in the present investigation with correlations based on the normative 
archive. Most SDQI research is based on factor scores instead of sim|)le 
scale scores in part because correlations among the scores are consistently 
lower -For factor scores (Marsh, 1988; Marsh, 1989; also see -footnote 1). 
Comparisons based on both scale and factor scores are considered here. Far 
present purposes, the mean of 28 correlations among 7 SDQI scores — all but 
Geneiral Self — and the mean of 7 factors selected a priori by Marsh (1989) 
to be among the lowest correlations were computed for each grade level 
(Table 6) C^5ee Marsh, 1989, for further discussion of the rationale of these 
analyses ard for findings based on high school student responses to the 
SDQI I and late adolescent responses to the SDQI 1 1 

Insert Table 6 A bout Here 
Correlations among factor scores are consistently lower than 
correlations among scale scores, but the pattern of results is very 
consistent for both factor and scale scares. Results from the normative 
archive data (see Table 6) indicate that the mean correlations decline 
consistently with age at least through 5th grade and then appear to level 
out (also see Marsh, 1989). Also, the difference between the mean of 
correlations posited to be lower and the mean of all correlations is smaller 
for the youngest respondents. Marsh (1989) interpreted these results based on 
the normative archive to indicate that responses to the SDQI scales become 
more dif^rentiated with age at least daring the 2nd to 5th grade period. 

A similar pattern of results is observed for the individually and group 
adfoiniste^ed SDQI responses in the present investigation. Far the 
indivlciially administered responses, correlations among the SDQI ^reo for 
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2nd grade students are substantially smaller than UyosB among Is^ grade and 
kindergarten students • Because the 2nd grade responses are also more 
reliable, these differences would be even greater if the correlations were 
corrected for unreliability. Whereas correlations among 1st grade and 
kindergarten students are similar, the 1st grade responses ere more reliable 
BO that correlations corrected for unreliability are somewhat smaller for 1st 
grade students than for kindergarten students. This genera;, pattern is also 
seen in responses to the group administered responses, though the difference 
between correlations based on 1st and 2nd grade responses is somewhat larger 
for the group administered responses than for tiie individually adkninistered 
responses. Also, the correlations among 2nd grade responses are nearly the 
same for both group and individually administered responses, whereas 
correlations among 1st grad« responses are somewhat larger for group 
administered responses than for individually adhiinistered responses. 

Because both the normative archive and the present investigation include 
responses by 2nd grade students, it is interesting to compare the 
correlations based on these responses. The correlations among 2nd grade 
responses in the present investigation are substantially smaller than those 
in the normative archive. In fact, correlations based on 2nd grade 
responses in the present ir<vestigation are very similar to those based on 
3rd grade responses in the normative archive. Although there are alternative 
interpretations of 'these findings, they suggest that 2nd grade students in 
the present investigation ere better able to differentiate among the SDQI 
scales than 2nd grade students in the normative archive. 1«Jb interpret this 
to mean that young children are apparently better able to cops ^ath this the 
individually administered procedure than with the standard group 
administration pror^dju-es. The better differentiation based on the group 
administered responses in the present investigation apparently reflects the 
facilitative effect of already having completed the the individually 
administered task using the same SDQI items. These comparisons support 
earlier interpretations and suggest the superiority of the individually 
administered responses. 
Sex and Age Effects 

Although not a central focus of the present investigation, responses by 
children in the present investigation provide an opportunity to evaluate sex 
and age effects in self-concept far children who are youngor than those 
typically considered. SDQI research (see Marsh, 1988, 1989) with slightly 
older children has consistently found that mean ^^esponses for most SDQI 
^ scales decline with age — Relations with Parents oeing a possible 

ERXC exception. In thi© pfr'ievious research there has also been a consistent 

— ^- ^ 
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pattern of counterbalancing sex differences \Jhat is apparently consistent 
with fejex stereotypes. Coring preadolescent years the largest sex differences 
were for Physical Ability (fcAvoring boys) and Reading (favoring girls/. The 
observed sex differences were reasonably r . ^j/^ctent across the early 
preadolescent to young adulthood period, -^ith the apparent exception of 
Physical Appearance self^-conoept. In second grade yir'^s had higher self- 
concepts of Physical Attractiveress than boys, but at older ages — 
particularly cbring high school years — girls had substantially lower self-* 
concepts of Physical Appearance. These previous results provioa a general 
basis of comparison for findings in the present investigation. 

For purposes rtf the prssent investigation, a repeated measures ^WJOVA 
was used to assess the effects of age (kindergarten, 1st and 2nd grades) and 
sex across the 8 SOQI scales measured with the individually adninistered 
responses (see Marsh, 1989, for a more detailed overview of the analyses 
with older children). For total self-concept averaged across all 8 scales 
(i.e., tl'ie main effect of the repeated measure variable) the effects of sex, 
age, and their interaction were all nonsignificant. In each case, however, 
the sizes of these effects varied significantly depending on the SDQI scale. 
Tests of simple ,na.ln effects (SPSS, 1986) were used to assess the effect of 
each scale (Table 7). 

Insert Table 7 About Here 
Age was significantly related to three SDQI scales: Physical 
Appearance, Peer Relations, and School. In each case only the linear effect 
of age was significant and the direction of the effects was negative. The 
effect of sex was statistically significant for three SDQI scales: Physical 
Ability, Physical Appear^*nce, and Reading. Girls had substantially lower 
self-concepts of Physical Ability, and modestly higher self-/ >cepts in 
Physical Appearance and Reading. There was also an age by sex interaction 
for Physical Ability. Whereas boys had higher self-«)ncepts at all three 
ages, the sizes of the sex differences increased with age. The effects of 
sex age were also assessed for each of the three SDQI total scores. The 
only effect to reach statictical significance, however, was the negative 
effect of age on the total nonacademic score. 

In summary, the effectq-of s«x and age W6:^e generally modest. Except for 
the larga sex di-fference in sel-F-c-ancept of Physical Ability, none of the 
effects of sex, age, nor their interaction accounted for more than 2*/. of the 
variance in any of the 3031 scores. The direction of statistically 
^ significant effects — and ev=sf) those that approached significance — • were, 
ERJC fiowevSr, similar tX3 those fcxind in other SDQI research with slightly older 
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children. In this respect, the consistent pattern of sex and age effects 
found here adds -Rirther support for the individually adninistsred SDQI 
respcnses. 

Discussion 

The purposes of the present investigation were to evaluate a new, 
adaptive procecbre for assessing multiple dimensions of self-concept for 
very young children and to seek answers to three theoretical cpjtestions. The 
central finding of the study is the clear support for the use of the 
individually administered SiDQI far very young children. Due in part to the 
psychometric support for this new assessment procedure, the study was able 
-to provide answers to three theoretical questions: <a> each of the 8 SDQI 
factors identified in responses by older children were identified here, 
indicating that sel-F-concepts factor are better defined and more distinct 
for very young children than was previously assumed; <b) the general self- 
scale is apparently well-defined at each of the ages considered here, 
casting doubt on the suggestion that general self-concept does not evolve 
before 8 years of age; and <c) consistent with the Shavelson et al. model, 
the multiple dimensions of self-concept did appear to become more 
differentiated with age for these very young children. Each of these 
conclusions, however, warrants further consideration. 

Very young children are much better able to differentiate among 
multiple dimensions- of sel-F-ooncept than previously assumed. Responses by 
older children to appropriately constructed self-concept instruments <e.g., 
Harter, 1982; Marsh, 1988) are well differentiated in that factor analyses 
have identified the factors that su(±. instruments are designed to measure. 
Harter and Pike <1984; also see Silon and Harter, 1985), however, concluded 
that very young children are only able to di-Fferentiate between two facets 
of self-concept representing general con^tence and social acceptance. In 
contrast to the Harter and Pike conclusions, the 8 a priori SDQI factors 
found in responses by older children were identified for children aged 5 to 
7. Even when we limited consideration to just the four sel-f-concept scales 
most like those considered by Harter and Pike, there was clear evidence for 
the superiority of a four-factor model over the two-factor model proposed by 
Harter and Pike. The critical di-fferences are apparently that we considered 
a wider variety of eelf-concfept domains than did Harter and Pike and we 
employed CFA which is a substantially stronger analytic technique than the 
exploratory factor analysis procedures that they used. At least the second 
part of this suggestion could be tested by reanalyzing the Harter and Pike 
<19B4) data using CFA instead of exploratory factor analyses. 
ERjC General self-concept is reasonably well defined -For the age range 
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cxDnsidered in the present investigation. Harter (1983; Harter & Pike, 1984; 
ISilon & Harter, 1985) reported that general self-cxincept responses did not 
€:nierge as a separate -Factor, were not reliable, and were not systematically 
related to the -Factors that were identi-Fied. In the present investigation 
our General sel-F scale was reasonably reliable for each age group. CFA of 
responses for at each age group identified the feneral self factor and the 
factor loadings were substantial • Furthermore, the correlations among the 
General e'irale and the other scales were consistently among the largest of * 
any or the ODrrelations. Our conclusions clearly conflict with those of 
Harter, but sa little information about the nature of their general self 
'items and their analyses in the pilot study conducted by Harter and Pike is 
available that it is uif^^icult to determine critical differences between the 
two studies. 

The general selfHioncept scale ?n the Si Ion and Harter (1985) study was 
similar in design and — based on respor;ses by older children — was 
substantially correlated with the General SB?f scale on the SDQI (Marsh & 
Gouvernet, 1989; Marsh & McDonald-Holmes, in press). The Silon and Harter 
study, however, was based on responses by educable mentally retarded 
children aged 9-12 who had mental ages of less than 8, apparently used the 
standard group administration procecLires instead of individually 
administered instruments used in Harter and Pike, employed exploratory 
factor analyses instead of CFA, and did not actually report reliability 
estimates for their General scale nor correlations between this scale and 
other scales. A reanalysis of the Silon and Harter data that focused on 
their General scale and used CFA instead of exploratory factor analysis may 
resolve some of the apparent conflicts. Nevertheless, generalizations based 
on older, retarded children using a different instrument and different 
administration procedires to results based on younger, normal children 
should be interpreted cautiously. 

ajpport for the existence of a reasonably well-defined General self- 
concept for very young children has important theoretical implications. Two 
apparently quite different models of the evolvement of self-concept have 
been proposed. Coopersmith (1967) and others have proposed that specific 
facets of self-concept evolvp from a global sense of self. Harter (1983, 
1986; Harter & Pike, 1984), however, claimed that general self-concept does 
not exist before the age of 8 and that this general self requires the young 
child to integrate the very concrete sslf-perceptions that young children 
have of themselves. CXir results suggest that Harter' s proposal may be wrong. 
Whereas Harter may be correct in her assumption that very ypung children do 
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not have the cognitive capacity to integrate systematically the sel-f- 
concepts in speci-Fic domains, this may not be way they -form a general sel-F- 
concept. We -Find the existence of a general sel-F-concept to be consistent 
with the "all-or-none" thinking that Harter (1983) has identified in very 
•,=ung children that apparently is net based on a systematic integration of 
domain-specific information. Relevant to this suggestion is the observation 
(Table 4) that the short-term stability of the General self scores are the 
lowest of all the SDQI scores for the 2nd grade students =^nd particular for ♦ 
the 1st grade students. This suggests, perhaps, that the basis of General 
Self concept is more ephemeral than would be the case if it represented an 
integrated average of specific domains. It should be noted, however, that 
research with adolescents and even young adults (e.g.. Marsh, in press-a; 
Marsh, Richards & Barnes, 1986a, 19S6b) has also found General Self to be 
less stable than domain specific facets of self-concept. 

The lack of support for the Barter's proposal may — by implication — 
support Coopersmith's alternative proposal. We feel, however, that this 
interpretation may be premature. Even though both domain specific and 
general self-concept factors were identified here, there is no basis for 
concluding that one preceded the other. Consistent with the Shavelson et al. 
model, a reciprocal pattern of relations in which General sel-F-concept both 
affects and is affected by content-specific domains of self is also 
possible. Furthermore, the reasonably well-defined factor structure 
underlying SDQI resporkes is clearly inconsistent with the empirical basis 
of Coopersmith's proposal. 

General self-conc^t for the very young children considered here was 
reasonably reliable at any particular time but was less stable over time 
than content specific dimensions of self. If General self were based on a 
systematic integration of content specific domains, however, it should 
logically be more stable. From this perspective it may not make sense to 
argue that distinctions among specific domains are made in relation to a 
pre-existing global sense of self. Instead, it appears that General self 
concept for very young children ages may reflect an unsystematic integration 
of specific domains of self concept that is easily swayed by mood or events 
of momentary salience. Furthermore, responses by older respondents to 
general self-scales, may — to' a lesser extent — also reflect such 
tendencies. For example. Marsh, Richards, and Barnes (1986a, 1986b> reported 
that responses by late-adolescents and young adults to the General self 
scale were the laast stable over time of any of the 13 SDQI II scales even 
<j though it was one of the most reliable at each time considered separately. 

Schwarz, Strack, Kommer, and Wagner described a particularly 
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relevant model of cognitive processing to explain why global Judgments of 
subjec±iv& vgell being by adilts were less stable than corresponding domain- 
specific judgments. According to their model, accurate global evaluations 
would "require time-consuming information processing, involving a systematic 
consideration of many aspects of one's life as well as a multitude of 
comparisons, and an integration of their ioplications into a single 
composite judgment" (p, 70). Because of the complexity of this task, they 
suggested that mood at the time is used as a judgmental short-cut or * 
heuristic device for inferring subjective well being, and may also affect 
the availability of information in specific domains. In a test of their 
model, they foind that minor events that impacted subject's mood state had 
more impact on global judgments than on domain-specific judgments. Although 
their focus was on global judgments of subjective well being, Schwarz 
(personal communication) found a similar — but weaker — pattern of results 
for Rosenberg-like measures of esteem. Thus, whereas adults apparently have 
the. cognitive capabilities to more fully integrate domain-specific 
information in making global judgments, they apparently do not do so in most 
situations. In this respect, the cognitive basis of global judgments by 
adults may resemble those by young children found here. 

Results of the present investigation support the Shavelson et al. 
(1976) hypothesis that self-concept becomes more differentiated with age. 
First, the average correlation among the SDQI factors L -omes smaller with 
increasing age. Second, the difference in fits of models positing 1, 2, and 
8 factors (or 1, 2 and 4 factors in tests of the Harter and Pike model) 
become larger with age. Finally, internal consistency estimates for the 
three SDQI total scores — total academic, total non-academic, and total 
self — do not vary substantially with age even though the 8 specific SDQI 
scales are substantially more reliable for older children. Also, comparisons 
with SDQI responses from the normative archive suggest that the individually 
administered, adaptive procedures may facilitate the differentiation of 
self-concept facets by very ycung children. Stipek and Maclver (1989) 
suggested that the failure of previous research to demonstrate the ability 
of very young children to differentiate among self-concept facets may be an 
artifact of difficulties introduced by existing self-concept measures, and 
C3ur results support this suggestion. However, the support -for the increasing 
di-fferentiation of seHHaancept responses with age found here may also 
reflect di-fferences in the ability of children to cope with even the 
indivicbally administered SDQI. Whereas «iis possibility is always a viable 
crnn alterrtative, it would apparently not be consistent with the finding that 
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responses by the youngest children were nearly as reliable as those of older 
children for the total scales and for the General self scale. Hence we 
cautiously interpret the results as offering support for the hypothesis that 
self-^ncept becomes more differentiated with age and that the identification 
of this differentiation is facilitated by the individually adninistered SDQI. 

Factor analytic results presented here inply that very young children 
are cognitively able to differentiate among the content domains represented 
on the SDQI. This adds to a general trend demonstrating that children can ^ 
perform a variety of cognitive tasks at younger ages than typically 
hypothesized. Whereas it is possible that the factors are merely statistical 
abstractions with no concrete reality or reflect response biases 
idiosyncratic to each scale, such counter-explanations seem implausible. In 
addition to the statistical support presented here, the spontaneous 
verbalizations of children in the interview setting suggested that they 
understood the difference between such content domains as physical 
appearance and school ability. However, alternative tests of this conclusion 
based on a different task (e.g., a sorting task based on the content of the 
items) could be used to test counter--explanations of our findings and, 
perhaps, to provide a developmental analysis of the processes through which 
^l"F~concepts are formulated by very young children. 

The Harter and Pike (1984) instrument was apparently the best available 
instrument for measuring multiple dimensions of self-concepts -for very young 
children, but the results for the present investigation suggest that the 
psychometric properties of the individually administered SDQI are stronger. 
Because the construction of appropriate self-report instruments for young 
children is a pervasive problem, it is useful to speculate about why these 
differences exist. Harter and Pike (1984) presented children with parallel 
sets of (orally presented) verbal statements and pictures, whereas we used 
only verbal statements. It is plausible that the pictures would facilitate 
the task as suggested by Harter and Pike, but the need to process parallel 
stimulus inputs may have complicated the task. The fbur--point response 
scale consisting two dichotomous choices used by Harter and Pike (1984) is 
similar to the one used here, and so this is an unlikely basis of the 
difference. Both studies administered the materials individually, but the 
procedires used by J^arter arid Pike wsre not presented in sufficient detail 
to cofipare them with the procedures used here. Our procedures provided 
considerable opportunity for the administrators to clarify — if necessary - 
- interpretatior^ of the test items and the responses, but this may not have 
ERXC ^ ^ Harter and Pike study. Because we measured twice as 

^^^^^^ ^ ^r^^ f !?^ Pi?<e» ouy^ Instrument was considerably longer (64 
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items vs. 24 itenis). Whereas it is plausible that the shorter instrument 
would be more effective, our results showed an apparent practice effect such 
that items from the last half of tfje SDQI were more effective than items 
from the first half that was larger for younger children. Hence, the brevity 
of the Harter and Pike instrument may havt been a weakness instead of a 
strength. Harter and Pike (1984) developed a new instrument specially for 
very young children whereas we adapted the existing SDQI for this purpose. 
Because the Harter and Pike instrument and its four a priori factors have * 
not been validated with children at any age, the failure to support the a 
priori structure with very young children may reflect problems idiosyncratic 
to the instrument rather than general developmental trends. Also, 
comparisons between responses to their instrument and responses to different 
instruments by older children may differ because of the age of the children 
or the differences in the instruments. In contrast, the SDQI used here is 
well-validated with responses by slightly older children and the availability 
of •this research facilitates the comparison of responses by very young 
children and by older children. Finally, the CFA used here was stronger than 
the exploratory factor analysis used by Harter and Pike, and we suspect that 
a reanalysis of their data with CFA would provide stronger support for their 
instrument as well as our conclusions about the factor structure of self in 
very young children. A detailed evaluation of these differences would require 
that both instruments were used in the same study and, perhaps, a systematic 
manipulation of differences in instrument construction. Hence, firm 
conclusions about why these differences exist are beyond the scope of the 
present investigation but warrant further consideration. 

In summary, results of the present investigation provide support for a 
new procecbre far assessing multiple dimensions of self^ncept with very 
young children. EXie in part to the success of this new procedure, we t^re 
able to provide new evidence far important issues related to the development 
of sel-F-concept in very young children. In particular, the results show that 
sel-F-concept is much better differentiated by very young children than has 
been previously assumed and that these children do have a global self- 
concept. The development of an improved procedure of assessing self-^ncept 
for very young children also. has important theoretical and practical 
implications beyond those specifically considered here. The considerable 
advances in self-concept theory and practice for older children in the last 
decade was apparently based in part on advances in the ability to measure 
^ appropriately mi 'tiple dimensions of self-concept and the same may occur for 

ERJC research for vei / young children. 
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FcxDtnotes 

1 — (Consistent with other research, studies based on all three SDQ 
instruments f5how that correlations among -Factors derived by CFA (e.g., Table 
2> are larger than corrvSlations among simple unweighted scale scores (e.g., 
Tables 5 and 6), which are higher than correlations among -Factor scores 
(e.g.. Table 6), which in turn are higher than -Factor pattern correlations 
(not considered here) based on exploratory -Factor analyses such as those 
conducted with SPSSx (1986). Whereas a technical discussion of the basis for 
these di-Fferences is beyond the scope of the present investigation, it is ♦ 
important to note that the pattern of correlations is typically very similar 
•For all the various sets of correlations. 

2 a^bsequent tests of the invariance of parameter estimates across the 
three age groups similar to those described by Marsh and Hocevar (1985) 
indicated that there were significant differences between the groups. In 
order to recbce the complexity of the materials presented and because the 
nature of these di-Fferences are discussed in relation to parameter estimates 
presented for each group separately (see Tables 3 and 4), these subsequent 
tests of factorial invariance are not presented. 

3 This inference was based in part on studies (Marsh & Gouvernet, 1989; 
Marsh & MacDonald-ftolmes, in press) that specifically compared the content 
of scales from the SDQI and the Harter (1982) instruments, and correlated 
responses from the two instruments using multitrait-multimethod analyses. 

4 The traditional MTMM term convergent validity is retained even though 
these correlations might be interpreted to reflect consistency or stability 
instead of validity. As noted by Marsh (1989) in his review of this analytic 
approach, MTMM analyses are appropriate when the different methods are very 
similar or very dissimilar. 
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Table 1 



C3oefficient Alpha Estimates of Reliability for Each Grade Level and the 



Scale 



Physical 
Appearance 
Peers 
parents 
Read 
Math 
School 
General 

Total Scores 
Non-Acadsmic 
Academic 
Total 



Individual (Ind) 


and Group (Grp) Administrati 


ons 




Kindergarten 


1st Grade 


2nd Grade 


Total Sample 


Ind Grp 


Ind 


Grp 



Ind 


Grp 


Ind 


Grp 


.505 ... 


.710 


.782 


.730 




.745 


.668 




.764 


.744 


.830 


.832 


.861 


.797 


.826 


.814 


.770 ... 


.753 


.809 


.807 


.842 


.786 


.828 


.692 ... 


.726 


.811 


.765 


.837 


.722 


.825 


.757 


.841 


.866 


.837 


.827 


.820 


.841 


.773 


.833 


.846 


.853 


.866 


.823 


.856 


.724 ... 


,312 


.786 


.831 


.868 


.796 


.839 


.726 ... 


.781 


.818 


.742 


.782 


.749 


o799 


• 845 • • • 


.885 


.916 


.879 


.886 


.870 


.902 


.690 ... 


.910 


.917 


.902 


.920 


.903 


.918 


• ^729 • • • 


.947 


.956 


.939 


.944 


.939 


.950 



Notes. The individually administered SDQIs were obtained from kindergarten, 
1st and 2nd grade students, whereas the group adninistered SDQI were 
obtained from only 1st and 2nd grp.ab stxidents. Coefficient alpha estimates 
of reliability depend on the mean correlation among items and the number of 
items. Hence, the total scores — which are based on more items than the 
scale scores — have higher reliabilities even though the mean of 
correlations among items tends to be smaller. 
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Table 2 

Goodness of Fit Indices far Alternative Models for Kindergarten <K>, 1ST 
Grade 2nd Grade, and the Total Sample (TOT) For Responses to the 
Individually and Group Administered Responses 



Individi'al Group 





df 


Chi-Sq 


TLI 


URFI 


Chi-Sq 


TLI 


URFI 


Null 


a 

Models 














K 


496 


2399.59 


0 


0 


" " " ■" ' " 




— — 


1ST 


496 


2976.20 


0 


0 


2743.26 


0 


0 


2ND 


496 


3066.68 


0 


0 


3320,97 


0 


0 


TOT 


496 


6746.06 


0 


0 


5159.94 


0 


0 


1 Factor Models 












K 


464 


1058.75 


.666 


.688 








1ST 


46^/ 


1191.28 


.697 


.707 


1397.29 


.556 


.585 


2ND 


464 


1595.39 


.533 


.563 


1770.20 


.506 


.538 


TOT 


464 


2270.48 


.691 


.711 


:::352.66 


.567 


.595 


2 Factor Models 












K 


463 


947.68 


.727 


.745 




■ 





1ST 


463 


1109.76 


.721 


.7^9 


1299.41 


.601 


.628 


2ND 


463 


1463.84 


.586 


.614 


1478.79 


.615 


.640 


TOT 


463 


1942.99 


~y ft 1 

.746 


.763 


1948. 10 


.659 


.682 


8 Factor Models 














<a priori) 














K 


436 


771,30 


.800 


.824 








1ST 


436 


760.33 


.851 


.869 


1015.20 


.707 


.742 


2ND 


436 


729.38 


.871 


.887 


884.58 


.819 


.341 


TOT 


436 


961.49 


.904 


.916 


1166.41 


.822 


.843 


8 Factor Models 














<a posteriori) 














K 


434 


735.64 


.819 


.842 








1ST 


434 


721.18 


.868 


.884 


957.46 


.734 


.767 


2ND 


434 


695,64 


.885 


.899 


866.85 


.825 


.847 


TOT 


434 


826.85 


.928 


.937 


1094.73 


.838 


.858 



Note.^ Chi-Sq = chi-square, TLI = Tucker-i.ewis Index, URFI - unbiased 
relative -fit index. Because the same model was -fit to both indivicxtally and 
group administered data, the df are the same -for both sets of data, 
a — The null model posits that all of the measured variables are 
unoorrelated, and is used to define the (poorest fitting) end^x)int for the 
TLI and the URFI. 
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Table 3 

Summary of Parameter Estirrtates -For Confirmator/ Factor Analyses of Three Age 
Groups and the Total Group for the 8-factar A Priori Model 



Individual Administration 
Factor Loadings 



Group Administration 
Factor Loadings 



Measured 
Variables 

Appjearance 

2 
3 
4 

Ph^ical 

2 
. 3 
4 

fleers 
1 
2 
3 
4 

Parents 

1 

2 

3 

4. 
Read 

1 

2 

3 

4 
Math 

1 

2 

3 

4 

School 
1 
2 
3 
4 

General 
1 
2 
3 
4 

Mean Factor 
Loading 

Mean Factor 



Mean measured 
variable 
Uniqueness 



Kinder 
garten 


1st 
Grade 


2nd 
Grade 


Total 
Group 


1st 
Grade 


2nd 
Grade 


Total 
Group 


.211 
.394 
.625 
.550 


.592 
.543 
.642 
.709 


.666 
.556 
.889 
.601 


.507 
.503 
.695 
.656 


.590 
.740 
.828 
.698 


.588 
.607 
.744 
.580 


.601 

.796 
.634 


.432 
.478 
.716 
.642 


.539 
.643 
.786 

.828 


.937 
.931 
.578 
.623 


.647 
.696 
.723 
.724 


.703 
.669 
.817 
.773 


.723 
.579 
.723 


.648 
.722 
.681 
.728 


.454 
.752 
.561 
.876 


.564 
.599 
.606 
.679 


.721 
.696 
.744 
.704 


.584 
.702 
.647 
.750 


.411 
.654 
.776 
.876 


.664 
.788 
.705 
.709 


.564 
.727 
.734 
.776 


.550 
.522 
.748 
.675 


.568 
.500 
.743 
.813 


.584 
.434 
.715 
,738 


.568 
.478 
.726 
.758 


.670 
.681 
.771 
.824 


.503 
.554 

.811 


.572 
.623 
.80 

.aii 


.589 
.653 
.743 
.741 


.718 
.775 
o717 
.770 


.652 
• 80G 
.803 
.785 


.659 
.751 
.761 
.784 


.875 
.801 
.77Z 

.884 


.598 
.679 
.882 
.774 


.737 
.717 
.826 
.832 


.4C0 
.747 
.766 
.749 


.562 
.650 
.836 
.696 


.697 
.699 
.868 


.701 
.(312 


.GOT 

.855 
.767 

. /*f / 


.709 
. 000 
.921 
.884 


.697 
.828 
.858 
.819 


•7/1/1 

• w 1 

.580 
.778 
.698 


.619 
.713 
.701 
.626 


.783 
.6y26> 
.639 
.960 


.602 
.653 
.705 
.768 


.546 
.638 
.617 
.732 


.669 
.743 
.807 
.901 


.649 
.703 
.746 
.821 


.673 
.783 
.610 


.679 
.771 
.757 


.Ho/ 

.659 
.702 
.682 


.520 
.660 
.744 
.679 


.860 
.680 
.778 
.709 


.543 
.652 
.748 
.690 


.669 
.665 
.770 
.717 


.610 


.671 


.714 


.663 


.732 


.709 


.722 


.636 




.478 


.596 


.682 


.425 


.579 


.643 


.499 


.448 


.540 


.511 


.429 


.473 
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of four measured variables. Each measured variable was allowed to load on 
only the factor that it was designed to measure and all other factor 
loadings were constrained to.bs zero. For this reason, the factor loadings 
from each analysis ^re presented as a single column even though they 
represent 8 di-Fferent factors. All factor loadings are statistically 
significant (standard errors typically vary between .06 and .09). Whereas 
only the means of factor correlations and measured variable uniquenesses are 
presented, the pattern of correlations in each analysis are generally 
similar to those among scales scores presented in Table 5 (though 
correlations among scales scores are not corrected for unreliability and 
thus tend to be analler). 
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Goodness of Fit Indices -For Alternative Models -for Kindergarten (K>, 1ST 
Grade, 2nd Grade, and the Total Sample (TOT) Based on the Barter and Pike 
(1984) Model 



Goodness of Fit Indicators 



MODEL 


Chi-Sq 


df 


TLI 


URFI 


a 

Null Models 








K 


761.95 


120 


0 


0 


1ST 


970.08 


120 


0 


0 


2ND 


1106.76 


120 


0 


0 


TOT 


2401.15 


120 


0 


0 


1 Factor 


Models 








K 


232.88 


104 


.768 


.799 


1ST 


259.29 


104 


.789 


.817 


2ND 


502.30 


104 


.534 


.596 


TOT 


655.40 


104 


.721 


.758 


2 Factor 


Models 








K 


211.17 


103 


.804 


.831 


1ST 


207.81 


103 


.856 


.877 


2ND 


428.08 


103 


.616 


.671 


TOT 


528.72 


103 


.783 


.813 


4 Factor 


Models 








K 


164.67 


. 98 


.873 


.896 


1ST 


147. IS 


98 


.929 


.942 


2ND 


161.47 


98 


.921 


.936 


TDT 


215.39 


98 


.937 


.949 



Notei^ Chi-Sq = chi-square, TLI = Tucker-Lewis Index, URFI = unbiased 
relative -Fit index. The 4 SDQI -factors (Physical Ability, Peers, Parents, 
School) that most closely match those proposed by Harter and Pike (1984) are 
considered in models summarized here. The two -Factor model corresponds to 
the model proposed by Harter and Pike in which one -Fa-tor (competence) 
incorporates the Physical Ability and School scplIcS, whereas the second 
(acceptance) incorporates the Peers and Parents scales, 
a — The null model posits that all of the measured variables are 
uncorrelated, and is used to define the (ptMrest fitting) endpoint for many 
of the goodness of fit indices. 
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Table 5 

Hultitrait-Hultiiethod Matrices of Relations Between Individually and Group Adainistered Self-Concept Responses For 
1st Graders (above the lain diagonal) and 2nd Graders (beloM the lain aiaoonal) 



Individual 



Group 



Scales Total Scores Scales 



Total Scores 



Phys -- .42 .57 .39 .46 .43 .62 .56 

flppr .27 — .49 .43 .36 .41 .50 .62 

Peer .34 .36 — .55 .41 .43 .57 .61 

Prnt .31 ^48 .46 — .36 .46 .49 .60 

Read .29 .42 .49 .43 — .35 .59 .49 

Kath .19 .27 .42 .40 .37 — .57 .59 

Schl .26 . 43 . 43 . 48 . 59 . 47 — .72 

Genl .36 .57 .53 .54 .45 .49 .54 — 

TNACD .63 .76 .75 .76 .56 .44 .55 .70 
TACD .30 .46 .55 .54 .82 .76 .85 .61 
TSELF .50 .66 .70 .70 .78 .68 .79 .72 

Phys <.53> .09 .13 .11 .18 .02 .09 .17 

fippr .20 <.45> .19 .17 .30 .14 .25 .35 

Peer .30 .31 <.54> .27 .35 .10 .23 .35 

Prnc .09 .24 .17 <.47> .24 .20 .28 .26 

Read .21 .21 .23 . 24 <.61> .19 .36 .23 

Hath -.01 .10 .17 .17 .20 <.55> .28 .22 

Schl .16 .19 .24 .38 .44 .33 <.44> .25 

Genl .22 . 35 . 39 . 33 . 44 . 24 . 34 <.41> 

THACD .38 .38 .37 .36 .38 .16 .30 .39 
TACD .14 .19 .25 .31 .50 .42 .43 .28 
TSELF .28 .31 .34 .37 .50 .34 .41 .37 

Convergent Validities 
Corrected For Unreliability 

1st <.54X.59X.56X.47X.47X.49X.43X.23> <.47X.49X.44> 

2nd <.72><.55X.66X.59X.73X.65X.52X.54> <.58><.60X.56> 

Notej. Convergent validities, the values in < >, refer to agreeient betneen Batching SDOI scales frot the individually 
ad«ini£tBred and group adainistered scales. Convergent validities were also corrected for unreliability (using 
reliability estisates froi Table 1). 



.76 


.62 


.74 


<.40> .24 


.23 


.02 


.30 


-.02 


.21 


.17 


.27 


.19 


.25 


.78 


.52 


.69 


. 14 


<.49> .34 


.22 


.15 


.15 


.13 


.15 


.37 


.19 


.30 


.83 


.57 


.75 


■ to 


.30 


<.44> .21 


.24 


.13 


.16 


.19 


.34 


.20 


.29 


.74 


.53 


.68 


.14 


.21 


.30 <.36> .13 


.14 


.11 


.15 


.31 


.15 


.25 


.51 


.78 


./I 


.14 


f A 




.12 <.40> .05 


.27 


.05 


.16 


.28 


.24 


.55 


.79 


.74 


70 




.25 


.28 


.41 <.41> .35 


.22 


.31 


.45 


.42 


.70 


.87 


.86 




• ov 


.31 


.23 


.44 


.19 <.34> .16 


.34 




,39 


.77 


.74 


.81 


26 


.37 


.37 


.22 


.34 


.18 


.31 <.19> 


.36 


.32 


.37 




.72 


.92 


.26 


.41 


.42 


.26 


.26 


.13 


.21 


.21 


<.42> .23 


.35 


.64 




.94 


.25 


.31 


.28 


.26 


.51 


.27 


.39 


.18 


.34 <.45> .43 


.88 


.93 




.28 


.38 


.37 


.28 


.43 


.22 


.33 


.21 


.41 


.38 <.42> 


.28 


.12 


.21 




.41 


.51 


.47 


.51 


.39 


.53 


.49 


.72 


.55 


.66 


.36 


.29 


.35 


.38 




.64 


.52 


.38 


.47 


.3? 


.57 


.80 


.47 


.68 


.49 


.28 


.42 


.35 


.51 




.72 


.47 


.61 


.56 


.69 


.88 


.64 


.82 


.33 


.30 


,34 


.17 


.36 


.37 




.47 


.63 


.59 


.66 


.83 


.65 


.80 


.30 


.49 


.45 


.31 


.38 


,39 


.46 




.46 


.77 


.44 


.56 


.86 


.77 


.15 


.42 


.33 


.05 


.25 


.31 


.50 


.42 




.64 


.64 


.65 


.81 


.79 


.33 


.50 


,47 


.26 


.44 


.45 


.57 


.72 


.60 




.60 


.63 


.92 


.84 


.45 


.42 


.48 


.42 


.65 


.71 


.57 


.54 


.37 


.56 




.74 


.65 


.75 


<.51> .35 


.47 


.64 


.78 


.79 


.67 


,53 


.40 


.60 


.82 




.71 


.92 


.31 <.55> .49 


.24 


.42 


.45 


.60 


.85 


.78 


.91 


.58 


.60 




.93 


.44 


.52 <.53> 


.46 


.64 


.67 


.70 


.79 


.68 


.87 


.76 


.87 


.92 





er|c 
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Table 6 

&imfnary of Scale Distinctiveness Analyses For Responses to the SDQI 



Mean Correlation Among: 



Sample 
and 

Age Level N 



All Scales 

Scale Factor 
Mean <x Scores Scores 



Selected Scales 

Scale Factor 
Scores Scores 



SDQI (Normative Archive) 



Gr^de 2 


176 


.83 


.55 


.43 


.49 


.37 


6» _de 3 


107 


.76 


.37 


.27 


.30 


.20 


Grade 4 


513 


.86 


.34 


.24 


.23 


.12 


Grade 5 


1,428 


.86 


.27 


.18 


.18 


.08 


Grads 6 


• 1,111 


.87 


.28 


.18 


.17 


.07 


SDQI (Individually Adninistered) 








Grade K 


164 


.71 


.45 


.35 


.47 


.37 


Grade 1 


169 


.79 


.48 


.36 


.48 


.35 


Grade 2 


169 


.81 


.38 


.28 


.30 


.19 


SDQI <Grcxjp ^Administered) 










Grade 1 


113 


.82 


.53 


.43 


.46 


.33 


Grade 2 


158 


.83 


.38 


.29 


.29 


.19 



the normative archive <Marsh, 1988; also see Marsh, 1989) , for the 
indiviciially administered responses, and the group adninistered responses. 
The means of the 28 correlations among all 7 SDQI scales (excluding the 
General scale for purposes of this analysis) and 7 correlations selected a 
priori by Marsh <1989) to be the lowest correlations were computed. Separate 
sets of correlations were computed for the siople scale scores <an 
unweighted average of responses designed to measure each scale) and for the 
factor scores <a weighted average of responses based on the a factor 
analysis that is part of the scoring program described by Marsh, 1988). 
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Table 7 

Sex and Age Effects in the SDQI Scale and Total Scores 



8DQ Score 


Sex 


Year in School 




Effect Size and 
Statistical Significance 


SlXai Scales 




Ki nder- 
garten 


1st 
Grade 


2nd 
Grade 


Age 
Linear 


Sex 


Sex by 

Aoe 

Linear 


Physical 


Boys 
Girls 


52.70 
47.37 


53.09 
48.06 


54.18 
44. 10 


-.03 




-. 10* 


Appearance 


Boys 
Girls 


50.77 
53.50 


49.37 
51.00 


47.54 
47.99 


18** 


.08* 


-.05 


Peers 


Boys 
Girls 


50.77 
50.62 


50.16 
51.36 


48.63 
48.93 


-.08* 


.02 


.01 


Parents 


Boys 
Girls 


48.02 
50.62 


48.66 
50.21 


51.30 
51.03 


.07 


.06 


-.06 


Read 


Boys 
Girls 


48.88 
50.48 


50.64 
50.76 


48.42 
51.47 


.01 


.08* 


.03 


Math 


Boys 
Girls 


51.52 
48.70 


51.07 
50.45 


48.97 
49.61 


-.03 


-.04 


.07 


School 


Boys 
Girls 


51.11 
50.87 


51.44 
50.32 


47.13 
49.50 


-.11** 


.02 


.05 


General 


Boys 
Girls 


49.22 
51.15 


50.22 
50.81 


49.13 
49.54 


-.04 


.05 


-.03 


Total Scores 
















Non-Academic 


Boys 
Girls 


50.74 
50.94 


50.33 

50.33 


50.29 
47.40 


-.08* 


-.05 


-.06 


Academic 


Boys 
Girls 


50.59 
50.01 


51.24 
50.61 


47.85 
50.24 


-.05 


.02 


.06 


Total Ssl-f 


Boys 
Girls 


50.72 
50.46 


50.91 
50.53 


48.84 
48.88 


-.07 


.01 


.01 



Note. All SDQI scores were standardized to have Mean = 50 and SD = 10 across 
the total sanple. The effects of the quadratic component of age and its 
interaction with sex were also tested, but they were ex.-:luded because they 
were not significantly related to any of the self-concept scores. Effect 
sizes are standardized beta weights. 
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< .05; ** p < ,01. 
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